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Abstract 

We consider flow rounding: finding an integral flow from a fractional flow. Costed flow rounding asks 
that we find an integral flow with no worse cost. Randomized flow rounding requires we randomly find 

an integral flow such that the expected flow along each edge matches the fractional flow. Both problems 

2 

are reduced to cycle canceling , for which we develop an 0(m log ^-) algorithm. 


1 Introduction 

Many modern network flow algorithms give solutions with fractional flow values, but often we’re interested 
in an integral assignment of flow. For any costed fractional flow with integral capacities, we can always 
change it to an integral flow with the same (or better) flow value and no worse cost. Further, we can always 
randomly find an integral solution so that the expected flow on each edge matches the fractional solution. 

Given a fractional solution as a starting point, we refer to finding an integral flow with no worse cost 
as costed flow rounding, and preserving the expected amount of flow along each edge as randomized flow 
rounding. Several authors have given O(mlogm) algorithms for costed flow rounding - notably a scaling 
algorithm that has (9(log m) parallel runtime (TJ, approaches for rounding max-flow solutions using just 
random walks 0 0- However, beyond a special case in Q where we round a unit acyclic flow, only the 
0{m 2 ) path decomposition of 0 is known for randomized flow rounding. 

Our approach relies on an observation about edges with fractional flow: in a circulation, every fractional 
edge must be in a cycle of fractional edges. Formalized in section [3] this means we can repeatedly find and 
cancel fractional cycles to yield an integral circulation. 

When canceling a given cycle, we can push flow in either direction. In costed flow rounding, at least 
one direction won’t increase overall cost. To approach randomized flow rounding, section [3] will show that 
simple random choices of direction preserve the expected flow along each edge. 

For the cycle cancelling problem, our main results are a practical ()(n 2 ) algorithm, and an 0{m log ^-) 
algorithm that smoothly combines the 0{n 2 ) approach with an 0(m log/i) dynamic tree solution given in 
l8l . This answers the conjecture of Goldberg and Tarjan 0, improving the speed of their minimum-mean 
cycle-cancelling algorithm and other rounding approaches involving cycle cancelling such as f2|. 

We begin with brief background on the dynamic tree data structure of 0 and flows, followed by the 
reductions of flow rounding to cycle canceling, and then present each cycle canceling algorithm in turn. 
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2 Background 

2.1 Dynamic trees 
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Both the 0(mlogn ) and 0(m log —) algorithms presented in this paper utilize the dynamic trees data struc¬ 
ture of (81. The original paper can provide greater detail, but we recall the relevant points. 

The data structure allows the maintenance of a dynamic forest of rooted trees over a set of nodes, and 
tree-path operations over the forest. Each relevant operation takes time logarithmic size of the trees involved. 
Generally, this means we assume O(logn) time per operation, but when we restrict the size of each tree to k 
in section [6] this will guarantee operations take Of log A:) time. 

Specific supported operations we will need are: 

• Link(u,v): Create an edge between u and v, making v a child of u. 

• Cut(u,v ): Remove the edge between u and v. 

• FindRoot(v ): Find the root of a tree which v belongs to. 

• PathAdd '(«, v,c): Add a number c to the weight of every edge along the u-v path. 

• PothMin(u.v): Report the edge with minimum weight over the u-v path. Break ties by reporting the 
edge closest to u. 

• PathSum(u,v): Report the sum of edge weights over the u-v path. 

2.2 Flows and circulations 

For the remainder of the paper, we will focus on circulations, flows in which no node has excesses and 
deficits. This is justified by reducing other cases to circulations. 

In the case of costed flow rounding, we will be given a fractional circulation over a costed graph, and 
attempt to find an integral circulation with no worse cost. Note that this allows us to round max-flow and 
min-cost max-flow solutions as well: we connect the sink to the source with an edge that has cost — °o to 
find a circulation, and this edge guarantees we won’t decrease the source-sink flow. 

For randomized flow rounding, to round a flow we can again link the sink to the source with an edge e 
that has flow value F, creating a circulation. After rounding the circulation, the expected flow along e will 
be F, so the source-sink flow is preserved in expectation. Further, the algorithms in this paper change the 
flow along each edge by at most 1 unit, so after rounding the source-sink flow will be between [Fj and \F~\. 
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3 Flow Rounding using Cycle Cancelling 

3.1 Fractional Cycles 

The following key lemmas motivate solving flow rounding using cycle cancelling. We assume that we are 
working with a graph with integral capacities throughout. 

Lemma 1 . If a circulation f has the property that the subgraph of all fractional edges forms a forest, then 
f is an integral circulation. 

Proof. Suppose not. Then there must be a leaf node v in the forest with only one connected edge that has 
fractional flow. The net flow into v cannot be zero, as its remaining edges have integral flow, which violates 
the net flow condition on a circulation. □ 

Lemma 2. If a circulation f contains no cycle of edges with fractional flow, then f is integral. 

Proof. A graph without cycles is a forest, therefore by Lemma [T]/ is an integral circulation. □ 

Now, suppose we begin with a fractional circulation /, from which we will construct a new circulation 
f (initially equal to /). If we cancel all fractional cycles in /', then f must be integral. To cancel a given 
cycle, we can push flow around it until the flow across an edge becomes integral. 

To capture the amount of flow required to make an edge integral, we define the availability of a directed 
edge (u,v) as: 

availability(u, v) = \f(u,v)~\ —f'(u,v) 

We can push exactly availability (w, v) units of flow along (u, v) before /'(«, v) becomes integral. This defini¬ 
tion applies for both directions along the edge. For example, if/'(w,v) =f(u,v ) = U, then availability(u,v) = 
0.3 and availability(v,u) = (—1) — (—1.7) = 0.7, indicating we can push 0.3 units of flow from u to v or 
0.7 units of flow from v to u before the flow along the edge becomes integral. Similarly, we define the 
availability of a directed path (or directed cycle) to be the minimum availability of edges along the path. 
This is the most flow we can push along a path before an edge becomes integral. 

With the concept of availability, the next lemma guarantees that we can cancel cycles without violating 
capacity constraints. 

Lemma 3. If an edge (u, v) satisfies availibil ity(u. v) > 0 and availability (v, u) > 0, then the new circulation 
f obeys the capacity constraints of the edge. 

Proof. If so, we have |~/(w,v)~| — f'(u. v) > 0 and \f(v,u)~\ — f'(v,u) > 0. Using antisymmetry, this gives us 
\f(u,v)~\ > f'{u,v) and L/(«,v)J < /(w,v). As the capacity of (u,v) is an integer, any flow value between 
\f(u, v)~| and [f(u, v)J must satisfy its capacity constraint, given that f(u,v) does. So f obeys the capacity 
constraint. □ 

If we only ever push flow along paths that is equal to the availability of the path, no edge can have its 
availability drop below zero. So the cycle cancelling method just described will yield an integral circulation 
f that obeys the capacity constraints. 

3.2 Costed Flow Rounding 

In the problem of costed flow rounding, edges have associated costs per unit flow. Given a fractional cir¬ 
culation /, we want to find an integral circulation f that has no worse cost than /. This is achieved by 
exploiting the choice of direction we can make when cancelling a cycle: we can make an edge on the cycle 
integral by pushing flow in either direction, so we can choose the direction that yields better cost. As cost is 
antisymmetric, either one direction has positive cost and the other has negative cost, or both directions have 
zero cost. In either case, we can always cancel the cycle without increasing the cost of the circulation. 
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3.3 Randomized Flow Rounding 

In our characterization of the randomized flow rounding problem, we are given a fractional circulation / 
with source s and sink t, and wish to randomly find an integral circulation f such that for every edge (u. v), 
Ex[f(«,v)]=/(«,v)[| We say that a procedure preserves flow in expectation if this is true. 

One known algorithm for randomized flow rounding is the path stripping algorithm originally proposed 
by J6l. Their analysis of using path stripping to approximate integer multicommodity flow problems holds 
true for any algorithm that solves the randomized flow rounding problem, as it relies solely on the fact that 
the flow is preserved in expectation. The path stripping algorithm runs in 0(m 2 ) time, and we can improve 
that with another adaptation of cycle cancelling. The idea will be to randomly choose the direction in which 
to cancel each fractional cycle, such that the flow is preserved in expectation. First, we show that we can 
compose operations whilst preserving the flow in expectation. 

Lemma 4. Let f he a random variable giving an initial flow. If p is a procedure that presen’es flow in 
expectation , then Ex[p(f)] = Ex[f], 

Proof. Because p preserves flow in expectation, Ex[p (g) — g] = 0 for a fixed flow g. Then, 

Ex[p (f) - f] = £Ex[p (f) - f|f = g]Pr [f = g] 

g 

= £ Ex [p(g)-<?] pr [ f =s] 

g 

= 0 


□ 

Lemma 5. Ifp\ , P 2 ,... ,p„ are procedures that preserve flow in expectation, then so does their composition. 
Proof. Let f 0 be the original flow, t'i =Pi(/ 0 )T 2 = p 2 (fi),... ,f„ = p„(f„_i). 

By Lemma[4] Ex[f; + i] = Ex[f;], so Ex[f„] = /o. □ 

Now, if we can show how to cancel a cycle whilst preserving flow in expectation, any sequence of these 
will preserve flow in expectation. Suppose that a given cycle has availability a forward and b backward. 
That is, if we cancel it by pushing flow forward we will add a units of flow to every edge, and pushing 
flow backward will subtract b units. Fix a particular edge on the cycle, initially with flow x, and with final 
flow X. If we cancel forward with probability p, then Ex[X] = x + pa — (1 — p)b = x + p(a + b) —b. So, 
choosing p = Ex[X] = x. Note that p is independent of x: we can choose p independently of the flow 
on the particular edges of the cycle, and every edge will have its value preserved in expectation. So by 
Lemma [5] we can repeatedly cancel cycles in this way to obtain an integral flow whilst preserving the flow 
in expectation, solving the randomized rounding problem. 

As such, our O(mlogy-) cycle cancelling algorithm directly improves on the 0(m 2 ) path stripping 
approach for randomized rounding, with applications in global routing |6j|. 


*Ex[X] will denote the expected value of a random variable X, and Pr[E] will denote the probability of an event E. Random 
variables are bolded. 
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4 Rounding in O [m log n) 

In this section, we present our appropriation of Sleator and Tarjan’s algorithm for making a flow acyclic 
lf8l into an 0(m log/;) algorithm for cycle cancelling. We present the algorithm in the context of rounding 
a costed circulation, but any method for cancelling cycles that only needs path aggregation can be handled. 
For flow rounding, the algorithm adds fractional edges one by one and cancels cycles as they occur. 

Initially we are given a fractional circulation / over a costed graph G = (V,E), and initialize our new 
circulation as f = /. We will build a graph G' = (V. F') incrementally, initially with no edges. We initialize 
a dynamic trees data structure, with every node as a single-node tree, that will represent the fractional edges 
in G' at all times. The data structure will keep track of the cost of each edge, flow along each edge, and 
availabilities in both directions. 

We proceed to add edges (w,v) E E with fractional flow f'(u,v) to G' one by one. If FindRoot(u ) = 
Find Root (v), a path from u to v exists and we have a fractional cycle (as in figure [la|). We find the sum of 
costs along the cycle, pick a direction in which cost is non-negative, and find the minimum availability in 
that direction. Then we push this amount of flow around the cycle by adding to the u-v path and f'(u,v). 

Now there must be at least one edge on the cycle with zero availability, and hence integral flow (as in 
figure [Tb|). If f(u,v ) is integral, we simply don’t add it to the dynamic trees. If any edge on the u-v path 
is integral, a minimum availibility query will yield an edge (x,y) with zero availability. We update f(x,y) 
with the flow stored in the data structure, and remove (x, y) (figure [Tc]). If any integral edges remain, we 
can find them with a minimum availability queries along the u-x path and the y-v path, and remove them 
recursively. 



(a) H and E are in the same tree, so 
adding (H,E) forms a cycle. 



(b) (B,C) has the smallest availabil¬ 
ity, so we push 0.2 units around. 



(c) Decreasing availabilities, (B,C) 
becomes integral and is cut. 


Figure 1: Cancelling a cycle in the 0{m\ogn) algorithm. 


After adding all the edges to G there are no cycles of fractional edges, and so by [5] f must be an 
integral circulation. Further, the availability of each edge never moves below zero in either direction, so by 
[3]/' satisfies all capacity constraints. Each dynamic trees operation used take O(logn) time, and we use a 
constant number of operations to add and remove each edge. As such, the total running time is 0{m\ogn). 
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5 Rounding in 0(n 2 ) 

In the 0(mlogn) algorithm, we processed edges in an arbitrary order. In this section we show how to process 
all edges from the same node in one batch, cancelling all the cycles introduced at once in 0(n ) time per node. 

We maintain a forest of processed nodes initially empty. To process a node x, we consider its fractional 
edges that are connected to trees in the forset. If x has more than 1 edge to a tree, then cycles will be 
formed. We describe a recursive algorithm. Cancel (u), that removes cycles involving the subtree of u. After 
invocation, we guarantee that there is at most one path to x through the subtree left. 

To perform Cancel (u), we first call Cancel (v) for each child v of u. Each call to a child will cancels 
cycles, and returns information about the single remaining path to x if exists. If more than one of u and 
its children have paths left to x, there will be a cycle consisting of two disjoint paths to x. Let the paths be 
pathDown and pathUp, such that patliDown is a u-x path, pathUp is an x-u path. 

At this point, we can cancel the cycle as before based off the aggregate information. In the case of 
costed flow rounding, we find cost (pathDown) + cost (pathUp), and swap the two if the cost is positive. 
Then we will send flow F equal to the minimum availability of pathDown and pathUp around the cycle, 
down pathDown , and up pathUp, and update the availabilities of each path. (Actually updating the flow 
values on each edge is deferred until later.) 

After cancelling a cycle, at least one of pathDown and pathUp will now have zero availability in one 
direction. We remove such paths (we will remove the integral edges later). We repeat this until at most one 
path from u to x remains, and return the new aggregate information. In particular, if p is the parent of u, we 
can compute the new aggregates for the path p-x using information about the edge (p,u) and the path p-x. 

To send flow around all of the cycles through x in 0(n) time, we push flow along paths in the tree in a 
batch operation. If we’re at a node u and want to push F units of flow down some pathDown and up some 
pathUp, we immediately update the edges between the paths and x, mark the last node of pathDown with 
—F, and mark the last node of pathUp with F. After running Cancel, we run a procedure UpdateFlow(u) 
on the root of each tree that does the following: 

• Let incomingFlow initially be the sum of the marked values of u. 

• Lor each child v of u: 

- Call UpdateFlow(v). Let the result be F v . 

- Subtract F v from f(u,v). 

- If f'(u,v) is now integral, remove it from the forest of fractional edges. 

- Add F v to incomingFlow. 

• Return incomingFlow. 

At this point, we can add x to the forest, as no cycles remain. Each above procedure takes time linear in 
the size of the forest, so adding every node takes 0(n 2 ) time. At termination, the fractional edges must form 
a forest, so by [2] the circulation is integral. 

Ligure[2]illustrates some of the steps in processing a node X with edges into the existing tree. The figures 
[2](a)(b)(c) depict Cancel(H), and the figures |2](d)(e) illustrate Cancel(C). 
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(a) In Cancel(H), cancelling the labelled cycle. (b) In Cancel(H), cancelling the cycle through L. 




(c) The result when Cancel ( H) returns. (d) Cancel (C) now cancels the cycle through F and H. 



(e) The result when Cancel(C) returns. 

Note available (X ,H) = 0. 

Figure 2: Illustration of the 0(n 2 ) algorithm. 
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6 Rounding in O (m log ^) 

Combining the dynamic trees of the 0(mlogn) algorithm and the batch processing of the Off) algorithm, 

2 

it’s possible to achieve an Ofn log —) algorithm for cycle canceling. 

The basic idea is to represent the forest of processed nodes maintained by the Off) algorithm as trees 
of clusters, where each cluster has a limited number of nodes, say less than 2k, and is represented using the 
dynamic trees data structure. We call a tree of clusters a primary tree, and will represent the primary trees 
by storing parent pointers in the root of each cluster. Each pointer gives the parent node, from which we can 
find the parent cluster. By trading off the size of the clusters against that of the primary trees we achieve the 
stated speedup. 

To keep the size of clusters in the required range, whenever we walk up a primary tree we will merge 
clusters with their parents if both contain less than k nodes. After such an operation, the following key 
properties hold: 

1. Each cluster has at most 2k nodes. 

2. For any two adjacent clusters, one must have more than k nodes. 

3. The number of internal (non-leaf) clusters in the primary trees is Ofi/k). 

Proof. Consider decomposing the primary trees into sets of paths with one leaf cluster per path. This 
can be done for a particular tree by: numbering leaf clusters from left to right; choosing the first path 
to be from the first leaf to the root; choosing successive paths to be from the next leaf cluster to its 
lowest common ancestor with the previous leaf cluster. 

Indexing all the paths found this way, let x, be the number of internal clusters in the z'-th path (so x; + 1 
will be the path length). Let zz, be the total number of real nodes (not clusters) contained in the z'-th 
path. For every two adjacent clusters on a path, there are at least k nodes. Thus n t > |_(x,- + 1)/2J k > 
Xik/2. So £x; < n/k, and thus the number of internal clusters of the primary trees is Ofi/k). □ 

Now we are in a position to describe the algorithm. As in the Off) algorithm, we proceed to add nodes 
one at a time to a processed set. Suppose we are adding the node x, and the number of edges from x to the 
processed nodes is d. 

Step 1 Merge clusters. 

We identify all the clusters with edges to x, and follow any parent pointers upward to discover the 
primary trees in which we might have introduced cycles. As we follow parent pointers, we merge 
adjacent clusters if both have less than k nodes. After merges, we will have reached 0(d + n/k) 
clusters in the search - at most 0{n/k) internal nodes and d leaves. If there were s merges, then this 
takes 0({s + d + n/k) log A) time (as we need O(logk) time to merge and traverse primary tree edges). 

Step 2 Process cycles within primary trees. 

Now we will use the DFS approach in our Off) algorithm on the primary trees found in Step 1. 
Note that each cluster in a primary tree can now have multiple edges to x, but the approach extends 
naturally. The algorithm needs to traverse each primary tree involved, and cancel one cycle for each 
of the d edges. Canceling each cycle will use a constant number of path queries within the clusters, 
as will moving between nodes in the primary tree. As each query takes Ofogk) time, this takes 
0((d + n/k) log A) time overall. Figure [5] illustrates this step. 


Step 3 Update parent pointers and link clusters to x. 

After Step 2, we’ve updated the flow on all involved edges, and removed the parent pointers and 
edges in clusters that have integral flow. No cycles remain, as v has at most one edge to each tree. To 
construct the new forest, we initialize a new cluster for v and add parent pointers to it from the nodes 
v is connected to. Now x will be the new root for several trees, and we will need to reverse some of 
the parent pointers. These changes will involve changing the cluster roots of 0(d) clusters (those that 
v remains connected to) and reversing at most 0(d + n/k) parent pointers, taking 0((d + n/k) log /c) 
time. 

Overall, we add a node x with degree d and s merges in 0{(s + d + n/k) \ogk) time. The total number of 
merges (over all nodes) and sum of degrees arc 0(m), so the cost of adding every node is 0((m + n 2 /k) log A:). 
Choosing k=^, this is 0(mlog ^). 




(a) Before processing cycles within each tree. 




(b) After cancelling cycles. Tree 2 is split, and x has at most one edge to each tree. 
Figure 3: Step 2 of the 0(m\og(n 2 /m)) algorithm. 
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